The Annals of Applied Probability 

2005, Vol. 15, No. 2, 1339-1366 

DOI: 10.1214/105051605000000034 

© Institute of Mathematical Statistics. 2005 

ON THE CONVERGENCE FROM DISCRETE TO CONTINUOUS 
TIME IN AN OPTIMAL STOPPING PROBLEM 1 

By Paul Dupuis and Hui Wang 

Brown University 

We consider the problem of optimal stopping for a one-dimensional 
diffusion process. Two classes of admissible stopping times are con- 
sidered. The first class consists of all nonanticipating stopping times 
that take values in [0,oo], while the second class further restricts the 
set of allowed values to the discrete grid {nh : n — 0, 1, 2, . . . , oo} for 
some parameter h > 0. The value functions for the two problems are 
denoted by V(x) and V h (x), respectively. We identify the rate of con- 
vergence of V h (x) to V(x) and the rate of convergence of the stopping 
regions, and provide simple formulas for the rate coefficients. 

1. Introduction. One of the classical formulations of stochastic optimal 
control is that of optimal stopping, where the only decision to be made is 
when to stop the process. Upon stopping, a benefit is received (or a cost is 
paid), and the objective is to maximize the expected benefit (or minimize 
the expected cost). Although the formulation is very simple, this optimiza- 
tion problem has many practical applications. Examples include the pricing 
problems in investment theory, the valuation of American options, the de- 
velopment of natural resources and so on [1, 2, 4, 5, 8, 11, 12, 19, 20, 21, 22]. 

The formulation of the optimal stopping problem requires the specification 
of the class of allowed stopping times. Typically, one assumes these to be 
nonanticipative in an appropriate sense, so that the control does not have 
knowledge of the future. Another important restriction is with regard to the 
actual time values at which one can stop, and here there are two important 
cases: continuous time and discrete time. In the first case, the stopping time 
is allowed to take values in the interval [0, oo], with oo corresponding to the 
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decision to never stop. In the second case, there is a fixed discrete set of 
times, and the stopping time must be selected from this set. Typically, this 
discrete set is a regular grid, for example, D h = {nh : n £ No U {oo}}, where 
ft, > is the grid spacing. 

In the present paper we focus exclusively on the one-dimensional case. 
Although a statement of precise assumptions is deferred to Section 2, a 
rough description of the continuous and discrete time problems is as follows. 

Continuous time optimal stopping. We use the stochastic process model 

^l = b(S t )dt + a(S t )dB t , 

where b and a are bounded continuous functions from M to M, and B is a 
standard Brownian motion. Although the results can be extended to cover 
other diffusion models as well, we focus on this model because of its wide 
use in optimal stopping problems that occur in economics and finance. We 
consider a payoff defined in terms of a nondecreasing function (ft : M. — > [0, oo). 
The payoff from stopping at time t is (f>(St), and the decision maker wants to 
maximize the expected present value by judiciously choosing the stopping 
time. This is modeled by the optimal stopping problem with value function 

V(x) = sup£[e- r XS r )|S* = x], 
res 

where r > is the discount rate and S is the set of all admissible stopping 
times. The stopping times are allowed to take values in [0,oo]. Let 

CV(x) = \a 2 (x)x 2 V"{x) + b{x)xV'(x). 

Then the dynamic programming equation for this problem is 

max[^(x) — V(x), CV{x) — rV(x)] = 0. 

If 4> is convex and nondecreasing, it is often optimal to stop when the pro- 
cess St first exceeds some fixed threshold x*. In this case, the value function 
V(x) equals 4>(x) for x > x*, and it satisfies the ordinary differential equation 
—rV(x) + CV(x) = for x < x*. 

Discrete time optimal stopping. The process model is the same as before, 
but the set of possible stopping times is restricted to those that take values 
in the time grid D h = {nh:n G No U {oo}}. The optimal strategy is often 
similar to the continuous time case: stop the first time S n h exceeds some 
fixed threshold x*. Let V h (x) denote the value function. The pair (V h (x),x^) 
satisfy the dynamic programming equation [25] 

i/Vtu/^Ii x€[a£,oo), 

W \ e -rh E[ yh {Sh)lSo=x]j xG ( ,4). 



CONVERGENCE OF OPTIMAL STOPPING 



3 



Closed-form solutions to this dynamic programming equation are not usually 
available. 

The aim of the present paper is to examine the connection between these 
two optimal stopping problems as h — ► 0. There are two questions of main 
interest: 

• What is the convergence rate of the optimal exercise boundary x\ to x*, 
and what is the rate coefficient? 

• What is the convergence rate of the value function V h (x) to V(x), and 
what is the rate coefficient? 

The goal is to use the more readily available solution to the continuous 
time problem to approximate the solution in discrete time. As we will see 
in Section 2, the optimal exercise boundaries converge with rate yh, while 
the value functions converge with rate h. In both cases there is a well- 
defined rate coefficient. The coefficient in the case of the exercise boundary 
is defined in terms of the expected value of a functional of local time of 
Brownian motion, while the coefficient for the value function involves both 
local time and excursions of Brownian motion. 

Few existing results are concerned with the rate of convergence of ap- 
proximations for this class of problems. Lamberton [17, 18] considers the 
binomial tree approximation for pricing American options and its gener- 
alizations in order to obtain upper and lower bounds (though not a rate 
of convergence) for the value function. The pricing of American options is 
equivalent to solving a finite-horizon optimal stopping problem, and there 
is no closed form solution. The goal in [17, 18] is, in fact, opposite that of 
the present paper, in that the discrete time problem is used to approximate 
the continuous time problem. 

As we have noted previously, the motivation for this study is to exploit 
situations where the continuous time problem can be more or less solved ex- 
plicitly (e.g., the one-dimensional problems considered in the present work). 
Our results allow one to explicitly compute accurate approximations for the 
discrete time problem, and thus avoid numerical approximation. Whether 
or not one can find a precise rate of convergence and rate coefficients for the 
analogous question in numerical approximation (where both time and state 
are discretized) is an interesting open question. 

The outline of the paper is as follows. In Section 2 we introduce notation 
and define the basic optimization problems. Two important universal con- 
stants are introduced in Section 3. In Section 4 we state the main result, 
give an illustrative example, and then lay out the main steps in the proof 
of the approximation theorem. The proofs of two key approximations which 
are intimately connected with the local time and excursions of Brownian 
motion are given in Section 5. The paper concludes with an Appendix in 
which (i) a result on a conditional distribution of the exit time is proved, 
and (ii) representations for the universal constants are derived. 
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2. Notation, assumptions and background. Consider a probability space 
(Q,J-,P;¥) with filtration F= (jF t ) satisfying the usual conditions: right- 
continuity and completion by P-negligible sets. The state process S = (St, Ft) 
is modeled by 

^L = h (St)dt + a{S t )dB u S = x. 
ot 

Here B = (Bt,J-t) is a standard Brownian motion. 
Define the continuous time value function 

V(x) = supE[e- rT <j){S T )\S = x], 
res 

where the supremum is over all stopping times with respect to the filtration 
F. Define the discrete time value function 

V h (x) = sup E[e- rT 4>(S T )\S = x], 
res h 

where S h is the set of all stopping times that take values in D h . 
The following assumptions will be used throughout the paper. 

Condition 2.1. 

1. The coefficients b : R — » R and a : R — > R are bounded and continuous, with 
inf a , e Rcr(x) > 0. Furthermore, xb(x) and xa(x) are Lipschitz continuous. 

2. </>:R— > [0, oo) is nondecreasing, and both <fi and its derivative (j) 1 are of 
polynomial growth. Furthermore, 

supe -r V(,S , t)eL 1 , lim e -rt 0(S , t ) = a.s. 

3. The "continuation" region for the continuous-time optimal stopping prob- 
lem takes the form {x:V(x) > <f>(x)} = (0,x*). 

4. The "continuation" region for the discrete-time optimal stopping problem 
takes the form {x : V h (x) > (j){x)} = (0, x*). 

5. The payoff function <fi is twice continuously differentiable in a neighbor- 
hood of x*. 

6. The smooth-fit-principle holds, that is, the value function V is C 1 across 
the optimal exercise boundary x*. 

As noted in the Introduction, V satisfies the dynamic programming equa- 
tion 

max[^(x) — V(x),CV(x) — rV(x)] = 0. 

Note that usually V is only once continuously differentiable across the 
optimal exercise boundary x = x*. Since <f>(x) = V(x) if x G [x*,oo) and 
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4>{x) < V(x) if x € (0,x*), it follows that V"(x*— ) > (j)"(x*), where the — 
denotes limit from the left. Define 

(2.1) A= — K > > 0- 

Although one can construct examples where ^4 = 0, as the next remark 
shows, the case A > is in a certain sense generic. We will assume this con- 
dition below, and note that the rate of convergence of the optimal threshold 
does not depend on A at all. 

Remark 2.1. The change of variable t = — logx can be used to trans- 
form the ordinary differential equation (ODE) Cf(x) — rf(x) = on (0,oo) 
into the ODE 

^(e-^W'it) + [i<7(e-*) - bie-'Wit) - rW(t) = 

on R. Since cr(x) > for x > 0, the classical theory for solutions of ODEs 
[3] can be used to show that the general solution to CV{x) — rV(x) = 
can be written in the form c\f\{x) + 02/2(3;), where fi(x) is positive and 
bounded as x [ and fy(x) is unbounded as x [ 0. Under Condition 2.1, the 
function f\ is twice continuously differentiable on (0, 00). V(x) is then equal 
to c\f\{x) for x G (0, x*] and equal to 4>(x) for x S [x*,oo), where c\ and x* 
are determined by the principle of smooth fit, that is, 

cih{x*) = 4>{x*) and cif[ (x*) = (j/(x*). 

Remark 2.2. If S is a geometric Brownian motion with b(x) = b and 
a{x) = a, and <^(x) = (x — k) + for some constant k, then Condition 2.1 holds 
when r > b [8]. For r <b, the value function for the optimal stopping problem 
is +00, and there is no optimal stopping time. For the boundary case r = b, 
the value function V{x) = x and there is no optimal stopping time. 

Remark 2.3. It is usually not a priori clear if parts 3 and 4 of Condi- 
tion 2.1 hold for a general state process. Counterexamples can be found in 
[6, 10]. Interested readers may also find the results in [7] helpful. 

Below we give a sufficient condition that is very easy to verify in the 
case 4>(x) = (x — k) + . Suppose parts 1 and 2 of Condition 2.1 hold, that 
derivatives of b and a exist and are Holder continuous for some 5 > 0, and, 
in addition, that 

r> sup {b(x) + xb'(x)}. 

x€(0,oo) 

We claim that parts 3 and 4 of Condition 2.1 hold. We will show that part 
3 holds and omit the analogous proof for 4. Now fix x>y. We have [15] 

Z t = St-S\= f X Dtdz, 
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where D* = dS* jdz satisfies the SDE 

= [b(S?) + S* t b\S z t )\ dt + [a(SD + S z t a\S z t )] dB u D* = 1. 

Note that D z and Z are both nonnegative processes. The condition on r im- 
plies {e~ rt Df} is a super martingale, and, therefore, so is {e~ rt Z t }. Observe 
that x>y implies 

4>{x) — <ft(y) = (x — k) + — (y — k) + < x — y. 

Thus, for any stopping time r G [0, oo], 

Ee- rT [ct>(S*) - 0(5?)] < E[e~ rT Z T } <Z = x-y, 

where the second inequality follows from the optional sampling theorem. It 
follows immediately that, for all x > y, 

V(x)-V(y)<x-y. 

This implies that 

{x : V(x) = cp(x)} = [x*, oo) 

for some real number x* . Indeed, if V(y) = 4>(y) = (y — k) + , then since V > 0, 
we must have y > k. It follows that, for all x>y, 

V(x) < V(y) + (x — y) = (y — k) + (x — y) = x — k = 4>{x). 

But V > (f> trivially, whence V{x) = 4>(x) for all x > y. This completes the 
proof. 

Remark 2.4. If 5 is a geometric Brownian motion with b(x) = b, and 
a{x) = <T, and 4>{x) = (52iAiX ai — k) + for some positive constants (Ai,cti) 
and k > 0, then one can show that V(x) — 4>(x) is decreasing, which in turn 
implies that parts 3 and 4 of Condition 2.1 hold. A similar argument can be 
found in [11, 14]. 

Remark 2.5. We wish to point out that part 6 of Condition 2.1 (i.e., the 
principle of smooth fit) is not an ad hoc assumption. Much research has been 
done on the validity of this principle under various conditions, especially for 
the one-dimensional diffusion case. Interested readers may find the list of 
references [6, 11, 14, 24] useful. 
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3. Two universal constants. In this section we introduce a pair of univer- 
sal constants that play an important role in determining the rate coefficients 
of the convergence. 

Let B be a standard one-dimensional Brownian motion. For a fixed con- 
stant u G [0, 1), define the process W = {Wt, t > u} by 

W t = B t -B u . 

In other words, W is a Brownian motion starting at time t = u with ini- 
tial condition 0. Let N = inf{ra G N : W n > 0} . Note that N is finite with 
probability one. Define 

(3.1) H(u) = EWn and M(u) = EW N . 
In terms of these functions, we define the constants 

(3.2) G = / H(u)du and T = [ M{u)du. 

Jo Jo 

Note that H{u) > M 2 (u), and, therefore, 

9 = J 1 H{u) du > ^ M 2 (u) du > (J 1 M(u) du^j = T 2 . 

Expressions (3.1) and (3.2) for G and T are useful for purposes of approx- 
imation (e.g., Monte Carlo simulation). However, the next lemma connects 
them with the quantities that will actually arise in the approximation of the 
optimal stopping problem. The proof of the lemma is given in the Appendix. 

Lemma 3.1. For every fixed u G [0, 1), 

rN 

H(u)=E l {Wt > Q} dt and M («) = EL% N (0) , 

J u 

where L^ N (0) is the local time ofW on the interval [u,N]. 

Remark 3.1. We have employed Monte Carlo simulation to obtain the 
approximations G ~ 0.589 and T ~ 0.582. 

4. The approximation theorem. Our main result is the following. 

Theorem 4.1. Assume Condition 2.1, and define the constants A, Q 
and T by (2.1) and (3.2). Assume that A > 0. The following conclusions 
hold for all x G (0, x*): 

1. 

V h (x)-V(x)_ I,,,, „ Q ^ 



V(x) 2 



-Axia z ( Xsf )(e-T z )h + o(h). 
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Tx*o (x* ) V~h + o(Vh ) . 



Example 4.1. Consider the special case where b{x) = 6 and <r(x) = a. 
Assume r > b and (j>{x) = (x — k) + for some constant k > 0. It follows that 
the value function for the continuous time optimal stopping problem is 

Bx a , x<x*, 

X Jv . X X % • 



V(x) 



where 

a= [ — ) +\H — ) B 

and 




According to the theorem, 



x 1 ! = 



x $ — A? . 
a — 1 



*(1 - Tav^) + o(v^) = — -(1 - TaVh) + o(Vh) 

a — 1 

and 

_ ^"(s*-) - ^'(s«) _ Ba(a - l)(x,) a ~ 2 

Jy' np i" 1 

which, after some algebra, yields 

V ^ X l- V ^ = -la(a - 1)(6 - rV* + o(h). 
V(x) 2 

4.1. Overview of the proof. In this section we outline and prove the main 
steps in the proof of Theorem 4.1. The proofs of two key asymptotic expan- 
sions are deferred to the next section. 

To simplify the analysis, we first introduce a bounded modification of the 
payoff function <f). This modification will not affect the asymptotics at all; 
see Proposition 4.2. 

Let <j) < <t> be an increasing function satisfying 

4>(x), if x < x* + a, 
k, if x > x* + 2a. 

Here a and k are two positive constants, whose specific values are not im- 
portant. Without loss of generality, we assume that cf> is twice continuously 
differentiable in the region [x*,oo). Suppose h and 5 are two positive con- 
stants, and let x$ = x* — 5. We consider the quantities 

W 5 (x)=E x [e- rTS 4>(S TS )] and W 5 (x) = E x [e~ rTS <P(S TS )i 



(4.1) ${x) 
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where 

t s = inf{i > 0:S t > x s }, 

and E x denotes expectation conditioned on Sq = x. Note that Wg(x) = 
Ws(x) for all x < x$. We also define 

W s h (x) = E x [e- rr ^(S T H)] and W s h (x) = E x [e^ fy(S 

where 

Tg = mi{nh > : S nh > xs}. 

These are all cost functions for an a priori fixed (and possibly suboptimal) 
stopping region. 

Main idea of the proof. The main idea for proving the rates of conver- 
gence is as follows. Write 

W s h (x) - V(x) = [W s h (x) - W s h (x)} + [W s h (x) - W 5 (x)} + [W 5 (x) - V(x)}. 

For each term, we will obtain approximations as h and 5 tend to zero. It 
turns out the leading term for the sum has the following form: 

— \a\5 2 + a2§Vh + 03/1 + higher-order term. 

Here ai,a2 and 03 are constants with respect to 5 and h (though some will 
depend on x) and with a\ > 0. Since x\ is the optimal exercise boundary for 
the discrete problem, the mapping 5 — > W^(x) attains its maximum at 5* = 
Hence, one would expect that 5* would approximately maximize 
the leading term, or 

(4.2) 5, = —Vh + o(Vh). 
Substituting this back in, one would further expect 

V h (x) = Wl (x) = (-^ + a 3 y + o(h). 

Thus, we obtain the precise asymptotic behavior of both the stopping regions 
and the value functions once the quantities a±,a2 and 03 are determined. 

This is, in fact, how the argument will proceed. We begin with the esti- 
mation of the first term, which turns out to be negligible for small h and 5. 
Define the quantity 

(4.3) A U = W?(x) - W*(x) = - 0(5 ,))]■ 
We have the following result. 



10 P. DUPUIS AND H. WANG 

Proposition 4.2. Define As,h by (4.3). There exist constants L < oo 
and e > such that 

\A s , h \<Le- £ / h 
for all sufficiently small 5 and h. 

Proof. The proof is based on the following bound. Let a be as in the 
characterization (4.1) of (p. Then for any x < x*, y > x* + a, and h > 0, we 
have 

(4.4) P(S h >y\S = x)<exp{-- 

where the positive constants c±,C2 depend only on the coefficients b, a. The 
proof of this inequality is a standard application of exponential martingales 
[23], and thus omitted. 

We now complete the proof of the proposition. To ease the exposition, we 
use r in lieu of rj 1 throughout the proof. We have 

oo 

A s,h = £ e~' rnh E x U{S nh ) - 4>{S nh ))t {T=n} ] 

71=1 

POO 

< £ e~ rnh / W{y) - 4>'(y)\P x {S nh >y,r = n) dy. 

n=l Ja+X * 

Fix an n € N. Define the stopping time a = inf{t > (n — l)h:St > x$}. 
Then 

P x {S nh >y\T = n) 

= P x {S nh > y\a = nh-t, S nh > x s , S( n _i) h <x s ,...,S < x s ) 

x P x (a £nh- dt\a < nh, S nh > x s , S^-^h <x s ,...,S < x s ). 

However, the strong Markov property implies, for all t 6 [0,h], that 
P x (S n h > yW = nh-t, S nh > x s , S( n _i) h <x s ,...,S < x s ) 
= P(S t > y\S = x s , S t > x s ) 
= P(S t >y\S = x s )/P(S t > x s \S = x s ). 

The denominator in this display is uniformly bounded from below away from 
zero, for all t S [0, 1] : 

P(St > x$\Sq = xs) > a > 0. 



log 
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(See the proof of Lemma 5.5 for the detailed calculations in an analogous 
setting.) Using (4.4), for all small h > and t E (0, h), 



P(S t > y\S = x s ) < exp 



1 

c 2 t 

< exp<j — 

c 2 h 



log- 



X $ 



at 



log c\h 

X 



Now since (j)' is of polynomial growth and 4>'{x) is zero for large x, it 
follows that there are finite constants R and m such that 

W(v) - 4>'{y)\ < Ry m ~ l for all y > x* + a. 
Hence, for all small 5 > 0, the change of variable x = log(y/x*) — c\h gives 



<h <-Y j e- rnh P x {r = n) 



y™" 1 exp 



a+x* 



1 

c 2 h 



log — — c\h 



dy 



-(x*) m e mClh Y e- rnh P x (T = n) / 
« ~x ^log(l+o/ 



3 mx-x 2 /(c 2 /i) ^ 



i/x,)- Clft 

Let $ be the cumulative distribution function for the standard normal dis- 
tribution. For h small enough, there exists positive numbers a,C,c such 
that 



A 5 , h <Cj2e- rnh P x (T = n) 



n=l 

<C , V / 2^-$ 



i 



/ ch 
1 



log 1 + 



log(l+a/x*) 

a 



-rn/i ni 



P*(r = n) 



n=l 



'ch 



log 1 + — 



■Vh. 



We complete the proof of the proposition by using the asymptotic relation 



$(— x) 



1 



-x 2 /2 



2ttx 



asu oo. □ 



The bound just proved shows that [Wj 1 (a;) — W^(x)] is exponentially small 
as h — > 0, uniformly for all small 5 > 0. We now consider the terms [Wg(x) — 
Wg(x)] and [^(x) — V(x)]. When considering the asymptotic behavior of 
these terms, it is often convenient to scale 5 with h as h — > in the manner 
suggested by (4.2). For the remainder of this proof, unless explicitly stated 
otherwise, we will assume that 



(4.5) 



5 = c^fh + o(Vh ) as h — > 
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for a nonnegative parameter c. With an abuse of notation, the quantities 
Wg(x) and W$(x) will be denoted by W^(x) and W c (x) when the relation 
(4.5) holds. 

We next estimate [W c (x) — V(x)] as h — > 0. 

Proposition 4.3. Assume Condition 2.1 and de/me A by (2.1). As- 
sume also that A > 0. XTien 

Wc(x) - V(x) = \-\Ac 2 h + o(/i)]y(x). 

Proof. Recall that V(x) can be characterized, for x < multiple 
of the bounded (in a neighborhood of zero) solution f\ to Cf(x) — rf(x) = 0; 
see Remark 2.1. ^(i) can be likewise characterized, with the constant 
determined by the boundary condition W c (xg) = (j)(x$). Thus, 

WJx) = ^\v(x) for all x£ (0,x s l 
V(x s ) 

We now apply Taylor's theorem for small 8 > 0, and use x<j = x* — 5, T^(x*) = 
0(x*),y'(x*) = </>'(x*), and the definition of A to obtain 

The proof is completed by using (4.5). □ 

In the next proposition we state the expansion for [Wg(x) — Wg(x)]. This 
estimate deals with the critical comparison between the discrete and contin- 
uous time problems. The proof of this expansion is detailed, and therefore 
deferred to the next section. 



Proposition 4.4. Assume Condition 2.1 and define A, and T by 
(2.1) and (3.2). Assume also thatA>0. Then 

Wj?(x) - W c (x) = [Tx*a(x*)Ach - ±AOxla 2 {x*)h + o(h)]V{x). 

Proof of Theorem 4.1. Recall that x\ is the optimal boundary for 
the stopping problem with value function V h . On the stopping region, we 
always have V h (x) = <j>{x). Also, since V (x) is defined by supremizing over 
a subset of the stopping times allowed in the definition of V(x), it follows 
that V h {x) < V(x). Since V(x) > (j)(x) for all x, it follows that x^ < a;*. 

According to Propositions 4.2, 4.3 and 4.4, for each fixed c S [0, oo), 

— -Ac 2 h + Tx*a(x*)Ach — -AQxlcr 2 (x*)h + o(h) . 



W?{x) - V[x) 

vTx) 



CONVERGENCE OF OPTIMAL STOPPING 13 

This suggests that the choice c* = rx*a(x*) should define the maximizer and 
also (at least approximately) the boundary of the optimal stopping region. 
Inserting this into the last display gives 



W*(x)-V(x) 



^A(T 2 -Q)xla{x if fh + o{h) 



V{x) 

and since V h {x) > W*(x), it follows that 

Now define c h by x\ = x* — c h ^fh. Since x^ < x*, we know that c h G [0, oo). 
By taking a convergent subsequence, we can assume that c^ce [0, oo . 
Using an elementary weak convergence argument, one can show that x^ — > 
x*. First assume that c£ (0,oo). If Tx^a^x^), then by Propositions 4.2, 
4.3 and 4.4, we have 

V h (x)-V(x) 1 Af n ^, 2 , n2 

limsup \ K 1 < -A{T 2 -<d)xla{x*)\ 
MO V(x)h 2 

which contradicts (4.6). If c = oo, then Propositions 4.2 and 4.4 and an 
argument analogous to the one used in Proposition 4.3 shows that 

Since (c h ) 2 — > +oo, this again contradicts (4.6), and, thus, c = Tx*a(x*). We 
extend to the original sequence by the standard argument by contradiction, 
and Theorem 4.1 follows. □ 



5. Approximations and expansions in terms of local time and excursions 
of Brownian motion. In this section we prove Proposition 4.4, which is 
the expansion W$ (x) — Wg(x) for small h > 0. Throughout this section we 
assume (4.5), which we repeat here for convenience: 5 = c\fh + o(Vh) as 
h — > 0. Recall also that subscripts of c and 5 may be used interchangeably 
under this condition. Define the error term 

£(x)=E x [e- rh Ws(S h )-W 5 (x)} Vx<x s . 

Observe that Wg has the representation 

W*M = ($( X ')> fovx>x s , 
vv SK x) \_ £ ( x ) +e -rh E tfy s (S h )\So = x], forx<x s . 
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It follows from the generalized Ito formula that 
e- rh W s (S h ) - W s (x) 

(5.1) = f 'e- rt l-r0(S t ) + C4>(S t )]t {St > Xs} dt 

J 

+ AWfa s ) [ h e- rt dL?(x s )+ [ k e- rt W' s (S t )S t a(S t )dW t . 
Jo Jo 

Here L s is the local time for the process S, and 

AW' & {x 5 ) = W' 5 {x 5 +) - W' 5 {x 5 -). 

It is straightforward to prove that the stochastic integral has expectation 
zero, and we arrive at the following result. 

Lemma 5.1. For every x € (O,^), 

e(x) = E x £ e - rt [-r4>(S t ) + £$(S t )]l {St > Xs} dt 

+ E x AW' 5 {x s ) f h e- rt dL?(x s ). 
Jo 

Let Tg = inf {nh: S n h > x$}. Then the discounting and the lemma just 
stated imply the formula 

W s (x) = -E x V e- rnh e(S nh ) + E x e- rT ^(S Th ) 

* — • 5 

n=0 

= -E x r e- rt l-r4>(S t ) + C4>(S t )]l {St > Xs} dt 

J 

- AW' s {x s )E x / e~ rt dLf(xs) + E x e~ rT s $(S t h) 
Jo s 

for all x < x$. Observe that the last term is precisely Wg(x) by definition. 
It follows that 

Wfc)-W s (x)=E x r e- rt [-rt(S t )+W(S t )]l {St > Xs} dt 

J 

+ AW' 5 {x s )E x [ T ° e- rt dL s t (x 5 ). 
Jo 



The proof of Proposition 4.4 is thereby reduced to proving the following 
two results. 
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Proposition 5.2. Assume Condition 2.1 and define A and 9 by (2.1) 
and (3.2). Assume also that A > 0. Then 

E x r e- rt [-r4>(S t ) + C0(S t )]t {St > X5} dt = [-\AQxW{ Xif )h + o(h)]V(x). 

J 

Proposition 5.3. Assume Condition 2.1 and de/ine A and T by (2.1) 
and (3.2). Assume also that A > 0. T/ien 

(5.2) AW' 5 {x 5 )E x e- rt dLf(x 5 ) = [Tx,a(x,)Ach + o(h)]V(x). 
Jo 

The proofs of Propositions 5.2 and 5.3 use estimates on the excursions 
and local time of Brownian motion, respectively, and are given in Sections 
5.1 and 5.2. 

5.1. Proof of Proposition 5.2. We recall Lemma 3.1, which states that 

rN 

H{u) = EW 2 N = E t {Wt > 0] dt, 

J u 

for arbitrary u S [0, 1). Here W = {Bt — B u ,t > u) is a Brownian motion 
starting at time t = u with initial condition W u = 0, and N = inf{n £ 
N:W:„>0}. 

Lemma 5.4. H{u) is continuous and bounded on the interval [0,1). 
Proof. Define 

rN 

Z u = t{w t >o} dt. 

Ju 

We first show that the family {Z u ,u G [0,1)} is uniformly integrable [and, 
in particular, that H(u) is bounded]. Indeed, define 

A rj + l 

cq= t{w t >o} dt, cj= %{w t >o}dt, ieN. 

Ju Jj 

The key observation is that if Cj > 0, then W must spend some time during 
the interval [j,j + 1] to the right of zero, therefore, the probability that 
Wj+i > is at least half. Thus, for all j G No, 

P(N = j + l\N>j, Cj >0)>l 

Let X u = J2f=()~ ^{cj>o\- Clearly, X u dominates Z u . Furthermore, the strong 
Markov property implies that 

P(X u >j + l\X u >j)<l-\ = \. 
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This, in turn, implies that P{X U > n + 1) < 2~ n , and, thus, 

oo oo 

E{Zl) < E{X 2 U ) = £ 2nP(X u > n) < £ ^ < 00 ■ 

Therefore, {Z u ,u€ [0,1)} is uniformly integrable. 
As for the continuity, we write 

rN 



H(u) = E / ± {Bt _ Bu > oy dt = EZ U . 

J u 



Fix any u € [0, 1) and let {u n } be an arbitrary sequence in [0, 1) with u n — > u. 
Since for P(B n — B u = 0) = for every fixed n, we have Z Un — ► Z u with 
probability one. Since the Z Un are uniformly integrable, H(u n ) — > H(u). 
This completes the proof. □ 

Now for any u E [0, 1) and h > 0, define the function 

1 r N ' lh , 

F(h;u) = -E / e^-^l-r^St) + C$(S t )]l {St > Xs} dt, 
n Juh 

b(St)dt + a(S t )dB t , S uh = x 5 



where 

dSt 
St 
and 



N h = mf{n£N:S nh >x s }. 

Let [a\ denote the integer part of a. It follows from strong Markov property 
that 

E x p e- rt l-r0(S t ) + C0(S t )]l {St > Xs} dt 

(5.3) 



hE x 


e~ rT6 F^h;j- 


n 


)] 











Consider the change of variable t^th and the transformation 

v (h) ^_ Sth ~ x s 

We can rewrite 

F(h;u)=E J e- r ^ h [-r^ + C4>](VhY t (h) + x d )l {Y ^ 0} dt, 

where follows the dynamics 

dY t {h) = (VhY t (h) + x 5 ) [Vhb(VhY t (h) + x s ) dt + a{VhY t (h) + x s ) dB t 



CONVERGENCE OF OPTIMAL STOPPING 



17 



with initial condition = 0. 

We have the following result regarding F(h;u). Although part of the proof 
is similar to that of Lemma 5.4, we provide the details for completeness. 

Lemma 5.5. 1. F(h;u) is uniformly bounded for small h and u G [0, 1). 

2. 

lim F(h;u) = [-rij> + C<i>]{x*)H(u), 

h— >0 

and the convergence is uniform on any compact subset of [0, 1). 

Proof. Consider the family of random variables {Zh tU :u G [0,1), h G 
(0, 1)}, where 

Z h , u = J e~ r ^ h [-r4> + m(VhY t W + x s )l {Y ( h)m dt. 

We first show this family is uniformly integrable. Since (— rc/) + C(j)) is bounded, 
it is sufficient to show that 

(5-4) X hiU = J l {Y < h)m dt 

are uniformly integrable. Define 

c ( h) = £ V/^o} dt > c f = hr t w >o} dt > 3 e N - 

As in the proof of Lemma 5.4, if Cj > 0, then spends some time to 
the right of zero in the interval [j, j + 1] . We claim that the probability of 
Yj!^i > is bounded from below by a positive constant: 

P(N h =j + l\N h >j,cf ] >0)>a>0 VnG [0, 1), h G (0, 1). 
To this end, it suffices to show that, for some a > 0, 

pt, h = P(Y t {h) > 0\Y {h) > 0) > a > Vi G [0, 1]. 
However, it is easy to see that 

Pt,h = P(S t h > %s\So > x s ) 

(r nth nth -< 

exp|y o [b(S u ) - ±a 2 (S u )} du + ^ a(S u ) dB u \ > 1 

p(j\(s u )dB u > cl thy 



> 
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where c\ = ||6||oo + ||c" 2 ||oo/2. We can view the stochastic integral Qt = 
Jo &(Su) dB u a time-changed Brownian motion. Indeed, there exists a Brow- 
nian motion W such that [13] 



Let 



Then 



Qt = w {Q)t . 

a = inf cr(x), a = supa(x) 

X X 

v 2 h < (Q) t < a 2 h. 



It follows that 



Pth>P\ min W s >c 1 th)=P[ min W s > ciVth ) , 

T 2 th<s<a 2 th J \a 2 <s<a 2 



where the last equality follows since {Wths/vth, s > 0} is still a standard 
Brownian motion. For h G (0, 1) and t € [0, 1], we can choose 



a = P[ min W s > c\ > 0, 

\<?<s<a 2 ~ ) 

which will serve as a lower bound. 
Now define 

M hu= 1 { c w >oy 

3=0 3 

which clearly dominates Xfi u . By the strong Markov property, 
P(M hiU >j + l\M h , u > j) < 1 - a, 

and, thus, 

p(M h}U >j)<(i- a y-\ 

This implies that 

oo oo 

which implies the uniform integrability of {Zh jU , u£ [0, 1), h € (0, 1)}. In par- 
ticular, F(h;u) is uniformly bounded for u G [0, 1) and h G (0, 1). 

For the uniform convergence, it suffices to show that, for any u G [0, 1) 
and any sequence u h G [0,1) converging to u, 

F(h;u h ) = EZ h h - [-r0 + £$](x*)H(u). 
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Let yC 1 ) be the process with = 0. As h —> 0, we have that y^O converges 
weakly to Y, where Y is defined as 

Y t = x*a{x*){B t - B u ). 

By the Skorohod representation, we can assume Y^> — ► Y with probability 
one. Using the uniform integrability, it suffices to show that 

z h, u h ~^Z= [-rep + / l{y t > } dt 

J u 

with probability one. Note that iV is almost surely finite, and that N h — ► iV 
with probability one. The almost sure convergence of Z h u h to Z then follows 
from the dominated convergence theorem, which completes the proof. □ 



lim E x 




T6 


)] 











Returning to the proof of Proposition 5.2, we claim that 

Q[-r4> + C$\{x*)E x [e- rT *) 



h 



To ease notation, let 



It suffices to show that 



lim E x [e- rT5 F{h;U h )}=®[-r4> + C$\{x*)E x [e- rT *}. 
h—*0 

In Proposition A. 3 in the Appendix we show the following (not very sur- 
prising) result. As h and 5 tend to zero, (rs,Uh) converges in distribution 
to (t*,J7), where U is uniformly distributed and independent of r*. More 
precisely, we have 

E x [e- rTS H(U h )] -► E x [e- rr *\ ^ H(u) du = ®E x [e~ TT *}. 

Jo 

Therefore, to prove (5.5), we must show that 

A = E x [e- rT6 F(h;U h )} - [-r0 + £$(x*)E x [ e - rT * H{U h )] - 0. 

Due to the uniform boundedness of F and H, there exists R £ (0, oo) such 
that 

\F(h,u)\ + \[-r$ + £$\(x*)H(u)\<R VuG [0,1), 
when h is small enough. Since Uh U, for h small enough, 

P(U h >l-e)<2e. 
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Also, by Lemma 5.5 for h small enough, 

sup \F(h,u)-[-r$ + £$\(x*)H(u)\<e. 

ue[0,l-e] 

It follows that, for h small enough, 

A < eP(U h < 1 - e) + RP(U h >l-e)<(2R + l)e, 

which completes the proof of (5.5). 

It follows directly from the definitions of V(x) and r* that 

(5.6) E x [e- rr *]=V{x)/0(x*). 

Also, the definition of A in (2.1) and the fact that (— rV + CV){x*— ) = 
imply that 

(—r4> + £4>)(x*) 

= (-rV + CV)(x,-) + ±a 2 (x*)x 2 S"(x*) - V"(x,-)] 

= ±a 2 (x*)x 2 S"(x*)-V"(x*-)} 

= |ct 2 (x*)z*A/>(:e*). 

Proposition 5.2 follows by combining the last display with (5.3), (5.5) and 
(5.6). 

5.2. Proof of Proposition 5.3. We recall the notation x$ = x* — S, where 
S = c\[h + o(s/h). It follows from the definition (2.1) of A and Taylor's 
theorem that 

AW^x s )=^(xs)-P^- ) V'(x 5 ) 

= [V"(x*-)-0"(x*)]5 + o(5) 

= cA(/)(x*)Vh + o{Vh). 
As a consequence, the main difficulty in proving (5.2) lies with the term 



E x [ T * e- rt dL s t {x 5 ). 
Jo 



As in Section 5.1, we consider the transformation 

v (h) ^_ Sth ~ xs 

Then Y"C0 satisfies the SDE 

dY t {h) = (VhY t (h) + x & )[Vhb{VhY t {h) + x s ) dt + a{VhY t {h) + x s ) dB t ]. 

We have the following lemma, whose proof is trivial from the definition of 
the local time and thus omitted. 
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Lemma 5.6. Suppose X is a semimartingale, and Yt = aX^t + v, where 
a > 0, b > 0, v are constants. Let L Y and L x denote the local times for 
Y and X , respectively. Then, for all t>0, 

Lj (ax + v) = aLfc (x) . 

It follows from the lemma that 

E x / 5 e~ rt dLf{x s ) = VhE x / e- rth dL^ (0). 
Jo Jo 

For any ti£ [0, 1), define the process 

Y t * = x*a{x*)B t , y; = o. 

Also define 

Q(u) = ELl* N (0) where N = inf{n G N : Y* > 0}. 
We have the following result. 

Proposition 5.7. 

pN h rl 

limE x e- rth dLY (h \0)=E x [e- rT *} Q{u)du. 
ft->o Jo Jo 

Before giving the proof, we show how the desired Proposition 5.3 will 
follow from Proposition 5.7. We have E x e~ rT * = V(x)/(j)(x*), and the defini- 
tions of Q and M, and Lemma 3.1 imply Jq 1 Q(u) du = x^a(x if ) Jq 1 M(u) du. 
When combined with the expansion given above for AWi(x$), the left-hand 
side of (5.2) is equal to 

V(x) f 1 
hcA(f)(x*)^-. — rx*a(x*) / M(u) du + o(h), 
(p[x*) Jo 

which is exactly the right-hand side of (5.2). 

Proof of Proposition 5.7. We consider the test function 

r o, if x < o, 

f{x) = lx, if 0<x<l, 
[k, ifx>2. 

We require f(x) to be increasing and smooth, except at the point x = (the 
specific choice of k is not important). It follows from the generalized Ito 
formula and the integration by parts formula that 

d[e~ rth f(Y t {h) )} = -rhe- rth f{Y t {h) ) dt + e~ rth D- f{Y t (h) ) dY t {h) 

+ \e- rth f"{Y t {h) )dY t {h) ■ dY t (h) + e~ rth dhf h) (0). 
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Without loss of generality, we let f"(0) = 0. Now we integrate both sides 
from to N h and take expected value. 

The first term on the right-hand side will contribute 

-rhE x e- rth f{Y t (h) )dt = -rhE x e~ rth f(Y t W ) dt, 

JO Jr s /h 

since f(x) = for x < 0. We recall the definition (5.4) of Xh )U . It follows 
from the strong Markov property that 



rN h fN h 

E x / e- rth f(Y t {h) )dt<kE x / 1 w dt = kE x G(h,U h ), 

Jr s /h Jr s /h i Y t ^ 



where 



Uh ~h 



h 



and 

G(h,u)=EX h>u . 

By the uniform integrability of X^u f° r small h and u £ [0, 1), E x G(h, Uh) is 
uniformly bounded for small h. Therefore, the expectation of the first term 
in the right-hand side goes to zero as h 0. 

The second term in the right-hand side contributes (observe that the 
stochastic integral has expectation zero) 

VhE x / e~ Tth D~f(Y t {h) ){VhY t {h) + x 5 )b(VhY t {h) + x&) dt. 
Jo 

Note that the integrand is bounded by l /v C)^n\ U P *° a proportional con- 
stant. It follows exactly as in the case of the first term that the contribution 
of the second term goes to zero. 

The third term in the right-hand side contributes 

E x / ±e- rth f"(Y t ih) )(VhY t {h) + Xs ) 2 a 2 (VhY t W + x s ) dt. 
Jo 



Since f"(x) = for x < 0, the expected value equals 

'rs/h 



E x / ±e- rth f(Y t W )(VhY t {h) + x 5 ) 2 a\VhY t {h) + x s ) dt. 
Jralh 



It follows from strong Markov property that the expectation can also be 
written 

E x [e- r ^F(h;U h )}, 
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where 

F(h;u)=E J ie-^Y'^^V^^+x^VCV^y^+x*)!^)^* 

and where satisfies the same dynamics with Yu = 0. Since the inte- 
grand is bounded due to the fact that f"(x) = for all x > 2, it follows from 
an analogous argument to the one given in the proof of Lemma 5.5 that: 

1. F(h; u) is uniformly bounded for small h and all u £ [0, 1); 

pN 

J(u) = \mxF(h;u) = \E \ f"{Y t *)xla 2 { Xif )dt 

and the convergence is uniform on any compact subset of [0, 1). 

The uniform convergence (on compact sets) of F and Proposition A. 3 in the 
Appendix imply that the expectation of the third term converges to 

E x [e- rr *} f 1 J{u)du. 
Jo 

We omit the details here since an analogous argument is used in the proof 
of Proposition 5.2. 

It remains to calculate the contribution from the term 

E *[e-^f{Y$)} = E*[e- r ^K(h, U h )}, 

where 

K{h-u) = E[e-^ Nh - u >f{Y$)\ 

with Y^ = 0. However, the boundedness and continuity of / ensure the 
following: 

1. K(h;u) is uniformly bounded for all h and all u £ [0, 1). 

I(u) = hmK(h;u) = E[f(Y£)\Y: = 0] 

h— >0 

and the convergence is uniform on any compact subset of [0, 1). 

Indeed, the first claim is trivial. As for the second claim, let u h — > u. Then 
as h — ► 0, Y^ =>■ Y*. By the Skorohod representation theorem, we can as- 
sume 

y(h) _> y* with probability one, which also implies that — > N with 
probability one. Therefore, Y^h ~^ Y^ with probability one. The claim now 
follows from the dominated convergence theorem. Similarly, 

E*[e~^f(Y$)] -+ E x [e- rT *] £ I(u) du 
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as h — > 0. It is now sufficient to prove 

I(u) - J(u) = Q(u) VuG [0,1). 
This is the same showing 

r N 

f{Y*N) - h / f"{Y;)xla\x*) dt - L^UO) 



o, 

J u 

where 

Y t *=x*a(x.)W t , Y* = 0. 

But this is a direct consequence from the generalized Ito formula and we 
complete the proof. □ 

APPENDIX 

A.l. Weak convergence of (T$,Uh). For an arbitrary y > 0, define the 
function 

P y (x : t) = P[ max S u > y\So = x ) . 

\ 0<u<t J 

We have the following lemma. 

Lemma A.l. For every fixed y > 0, function P y G C 1,2 ((0,y) x (0,oo)) n 
C((0,y) x [0,oo)) and satisfies the parabolic equation 
Qpy 



dt 



■(x,t) + CP v (x,t) = 0, (x,t) G (0,y) x (0,oo). 



Proof. It follows from a standard weak convergence argument that P y 
is a continuous function; see, for example, [16]. Let (xo,io) £ (0)2/) x (0, oo) 
and define the region 

D = (x - e,x + e) x (t -e,t ). 
Consider the parabolic equation 
du 

- — (x,t) + Cu(x,t) = 0, (x,t)eD, 

with boundary condition u = P y on its parabolic boundary. It follows from 
standard PDE theory that there exists a classical solution u [9]. It remains 
to show that u = P y in the domain D. Define the stopping time 

T = M{t>0:{t o -t,S t ) £D}. 

It follows that the process u(St,to — t) is a (bounded) martingale. In partic- 
ular, 

u(x , t ) = E xo u(S T ,t - t) = E X0 P y {S T , t -r) = P y (x , t ). 
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Here the last equality follows from the strong Markov property. □ 

For fixed < x < y, the density of the hitting time T y is defined as 

gpy 

p y( x ,t) = —(x,t). 

According to the preceding lemma, p y is continuous in the domain (0, y) x 
(0,oo). 

Lemma A. 2. Suppose y n ^ y*, then P Vn (x,t) — > P y *(x,t) and p Vn (x ,t) - 
p y (x,t) uniformly on any compact subset o/(0,y*) x (0,oo). 

Proof. It suffices to show that P y "(x,t) — > P y (x,t) uniformly on any 
compact subset. The uniform convergence of p Vn then follows from [9], Sec- 
tion 3.6. Suppose D = [xq,xi] x [to,*i] ^ (0,y*) x (0,oo) is a compact subset. 
In the following, we will denote P Vn and P y * by P n and P, respectively. Also, 
we assume y n < y* for all n, which implies that 

Pn(x,t)>P( X ,t). 

An analogous argument can be used for the case y n >y*- 
For any e > 0, we want to show that, for large enough n, 

0<P n {x,t)-P(x,t)<e V(x,t)eD. 

Define 

r = inf{t >0:S t > y*}; r n = inf{t > : S t > y n }- 

Since P is continuous, it is uniformly continuous on the compact subset D. 
It follows that there exists a number h such that 

P(t < t < t + h\S = x) = P(x, t + h)- P(x, t) < | V (x, t) G D, 
and, thus, for all (x,t) £ D, 

P n (x,t) - P(x,t) = P x (r n <t,T>t)< P x (r n <t,r>t + h) + ^. 
However, it follows from strong Markov property that, for any (x,t) E D, 
P(T n <t,T>t + h\S = x)<P( max S t < y*\S = y n ) ■ 

\0<u<h J 

Note that the right-hand side is independent of (x, t) £ D. A proof analogous 
to that of Lemma 5.5 yields that the right-hand side is dominated by 



P 



( max 


'- C M + B~ t 


< log 


y*\ 


\0<t<a 2 h 


a_ 




yn) 



For n big enough, this probability is, at most, e/2 since y n — > y*. This com- 
pletes the proof. □ 
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Proposition A. 3. Suppose f : [0, oo) — > M is a bounded continuous func- 
tion with 

lim f(x) = 0, 

and g : [0, 1) — > M a continuous, bounded function. Then 



lim £ a 



/? 



jo 



/or aZZ x € (0, cc*). 



Proof. Fix x G (0,x*). Let and p denote the density of t<5 and r*, 
respectively. We can assume that all xs are close to x*, in the sense that 
5 < 5q for some Sq, and x < xa. Since f{x) — > as x — > oo, we have 



/? 



/? 



oo 



P<5(s)ds. 



For any e > 0, there exists < a < M < oo such that 



J[0,a] \h 



Ps(s)ds < 



< 

< e 



\g\\ooP{Ts < a) 
\g\\coP{T8 < a) 



and 



m 9 ( , 

[M,oo] V ft 



Ps(s)ds < max|/(x)| • \\g\loo < e. 

M<x 



Note that such choices of (a, M) also make the above inequalities hold when 
ps is replaced by p. Also, since ps — > p uniformly on the compact interval 
[e, M], we have 



/(*)«/ 



h 



\ps(s) ~p(s)\ ds<e 



for 5 small enough. It remains to show that, for h small enough, 
f(s)g[r- t )p(s)ds- I f(s)p(s)ds- ^ 



M r-1 

f{s)p(s)ds- / g(u)du 



<£. 



We omit the rather straightforward proof, which follows easily from the 
uniform continuity of /, p, and / ■ p on compact intervals. □ 
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A. 2. Proof of Lemma 3.1. We first prove the representation for H{u). 
Consider the continuously differentiable convex function 

0, ifx<0, 



f(z)-2\* > ~\x 2 /2, ifx>0. 
It follows from the generalized ltd formula [13] that 

rNAn rNAn 

f(W NAn ) = / W t l {Wt >o} dWt + l / 1 {w t >o} dt 

J u J u 

for all integers n £ N. This yields 

i-NAn 

E(W+ An ) 2 = E t {Wt >o } dt VnGN. 

Ju 

Letting n — > oo, the right-hand side converges to E l{ Wt>0 y dt by the 
monotone convergence theorem. Since W^ An < Wjy, the result will follow 
by dominated convergence if one can show that EW^ is finite. 
To this end, we observe that 

oo 

EW 2 N = J2E(W 2 N l {N=n} ) 



n=l 

oo 



' — ' t>oo 

y/ 2xP(W N > x, N = n) dx 

°° POO 

/ 2xP(W N > x\N = n)P(N = n)dx. 



However, on the set {N = n}, the Brownian motion sample path must cross 
zero during time interval (n — 1, n}. Let denote the cumulative distribution 
function for the standard normal distribution. For every t 6 [0, 1], we have 
the inequality 

P(W t > x\W t > 0, W = 0) = 2$(-x/v / t) < 2$(-x) Vx > 0. 
Then the strong Markov property easily implies that 

P(W N >x\N = n) = P{W n > x\ W n > 0, W n -x < 0, . . . , Wl < 0) 
< 2<f>(-x). 

Since N is finite with probability one, it follows that 

POO 

EWx< / Ax$(-x)dx < oo. 
Jo 

It remains to show the representation for M(u). It follows from Tanaka's 
formula that 

r N 

W N = W+= l {Wtm dW t + L™ N (0). 

J u 
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However, since the preceding proof already implies that E J u l{w t >o} dt < oo 
(and, hence, that the stochastic integral has zero mean), we have 

EW N =EL^ N (0) = M(u). 

This completes the proof. 
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